3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
99 movie screenplays and plot synopses OtherProduction Status:
Existing-used
Use:
Identification of narrative structure of screenplays
-
Paper title:Screenplay Summarization Using Latent Narrative Structure
-
Paper track:Long/Summarization
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Pinelopi Papalampidi | TRIPOD | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:NAT: Noise-Aware Training for Robust Neural Sequence Labeling
-
Paper track:Long/Information Extraction
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marcin Namysl | CoNLL 2003 Shared Task: Language-Independent Named Entity Recognition | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese English French Japanese Korean Russian
Availability:
Freely Available
License:
Size:
5000 sentences Production Status:
Newly created-in progress
Use:
Analysis of cross-linguistic morphosyntactic divergences
-
Paper title:Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dmitry Nikolaev | Aligned sub-corpus of Parallel Universal Dependencies | /N |
Documentation:
None
Written
Named Entity Recognition,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
550 KByte Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling
-
Paper track:Short/Dialogue and Interactive Systems
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zihan Liu | CSB SciTech News NER | /N |
Documentation:
None
Written
Named Entity Recognition,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
3.3 MByte Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling
-
Paper track:Short/Dialogue and Interactive Systems
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zihan Liu | CoNLL 2003 Shared Task Named Entity data | /N |
Documentation:
None
Written
Dialogue Natural Language Understanding,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
2 MByte Production Status:
Existing-used
Use:
Dialogue
-
Paper title:Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling
-
Paper track:Short/Dialogue and Interactive Systems
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zihan Liu | SNIPS-NLU | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Translationese as a Language in "Multilingual" NMT
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Parker Riley | WMT data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
5,384,546 tokens Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:Storytelling with Dialogue: {A} {Critical Role Dungeons and Dragons Dataset}
-
Paper track:Long/Summarization
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Revanth Rameshkumar | CRD3: Critical Role Dungeons and Dragons Dataset | /N |
Documentation:
https://github.com/RevanthRameshkumar/CRD3
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons BY-NC-ND 3.0
Size:
20 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Curriculum Pre-training for End-to-End Speech Translation
-
Paper track:Long/Speech and Multimodality
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chengyi Wang | TEDLIUM | /N |
Documentation:
Ambient Search: A Document Retrieval System for Speech Streams
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
1.1 GByte Production Status:
Existing-used
Use:
Natural Language Understanding
-
Paper title:DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
-
Paper track:Short/NLP Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ji Xin | GLUE dataset | /N |
Documentation:
None




